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ABSTRACT 

A variety of methods have been proposed to define and to quantify galaxy environments. While these 
techniques work well in general with spectroscopic redslrift samples, their application to photometric 
redshift surveys remains uncertain. To investigate whether galaxy environments can be robustly 
measured with photo-z samples, we quantify how the density measured with the nearest neighbor 
approach is affected by photo-z uncertainties by using the Durham mock galaxy catalogs in which the 
3D real-space environments and the properties of galaxies are exactly known. Furthermore, we present 
an optimization scheme in the choice of parameters used in the 2D projected measurements which yield 
the tightest correlation with respect to the 3D real-space environments. By adopting the optimized 
parameters in the density measurements, we show that the correlation between the 2D projected 
optimized density and real-space density can still be revealed, and the color-density relation is also 
visible out to z ~ 0.8 even for a photo-z uncertainty (cta z /(i+z)) u P to 0.06. We find that at the redshift 
0.3 < 2 < 0.5 a deep (* ~ 25) photometric redshift survey with ua z /(i+z) = 0.02 yields a comparable 
performance of small-scale density measurement to a shallower i ~ 22.5 spectroscopic sample with 
~ 10% sampling rate. Finally, we discuss the application of the local density measurements to the 
Pan-STARRSl Medium Deep survey, one of the largest deep optical imaging surveys. Using data 
from ~ 5 square degrees of survey area, our results show that it is possible to measure local density 
and to probe the color-density relation with 3er confidence level out to z ~ 0.8 in the PS-MDS. The 
color-density relation, however, quickly degrades for data covering smaller areas. 

Subject headings: galaxies: evolution - galaxies: formation - galaxies: halos 


1. INTRODUCTION 

Recent observations have shown that various galaxy 
properties such as star formation rate, color and mor¬ 
phology are strongly correlated with galaxy environment 
(Hogg et al. 2004; Blanton et al. 2006; Haines et al. 2006; 
Cooper ct al. 2006, 2007; Capak et al. 2007; Elbaz et al. 
2007; Gallazzi ct al. 2009; Cooper ct al. 2012; Mostek 
et al. 2013; Lin ct al. 2014; Darvish et al. 2014). These 
studies indicate that galaxies located in dense environ¬ 
ments, such as galaxy groups and clusters, tend to be 
redder, elliptical and with lower star formation rates. 
Several physical processes, including ram pressure strip¬ 
ping (Gunn & Gott 1972; Quilis et al. 2000), high speed 
galaxy encounters (galaxy harassment; Moore et al. 
1996), galaxy-galaxy mergers (Mihos & Hernquist 1994), 
and removal of warm and hot gas (strangulation; Lar¬ 
son ct al. 1980; Balogh et al. 2000; McCarthy et al. 2008) 
have been proposed to explain the observed relation be- 
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tween environment and galaxy properties. Yet exactly 
how the environment affects the evolution of galaxies and 
how important it is as opposed to internal properties of 
galaxies (e.g., stellar mass) is still unclear. Part of the 
discrepancy between previous studies may come from the 
differences in the sample selection as well as the definition 
of environment, which make the comparisons non-trivial. 

One of the common approaches to characterize the 
galaxy environment is to use the local overdensity of 
matter. For the rest of this paper, we refer the galaxy 
environment to the observed overdensity of galaxies as 
a proxy for this. A variety of methods have been used 
to define galaxy density field, for example, (1) the Fixed 
Aperture method which counts the number of neighbor 
galaxies in a fixed volume around each galaxy (e.g., Gal¬ 
lazzi et al. 2009; Griitzbauch et al. 2011); (2) the Annu¬ 
lus method which counts the number of neighbor galaxies 
within a circular ring around each galaxy (e.g., Wilman 
et al. 2010); and (3) the IV th nearest neighbor that de¬ 
fines the local density by finding the distance from the 
individual reference galaxies to the N th nearest galaxy 
(e.g., Casertano & Hut 1985; Gomez et al. 2003; Baldry 
et al. 2006; Haas ct al. 2012). A fundamental and cru¬ 
cial quantity for these methods to work is the redshift 
information, which provides the information about the 
line-of-sight separation (in the absence of peculiar veloc¬ 
ities) of two given galaxies. Observationally, there are 
two types of redshift that are used widely: spectroscopic 
redshifts and photometric redshifts (hereafter spectral-z 
and photo-z respectively). While the spectral-z samples 
have greater precision in the redshift measurement, they 
suffer from incompleteness and are observationally ex- 
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pensive for high-redshift galaxies. 

To date, environment studies using large spectroscopic 
surveys such as SDSS (York ct al. 2000), DEEP2 (Davis 
ct al. 2003; Newman et al. 2013) and zCOSMOS (Lilly 
et al. 2007) have been limited to redshifts lower than 
z ~ 1.5 (see Tanaka et al. 2004; Haines ct al. 2006; 
Cooper et al. 2006, 2007; Mostek et al. 2013). In contrast, 
photo-z surveys provide larger sample sizes and reach to 
higher redshifts, but they suffer from poorer redshift res¬ 
olution. Figures 1-3 show how the photo-z uncertainties 
distort the real galaxy environment from different view¬ 
points. The large-scale structures are clearly revealed in 
the case without photo-z error but become less promi¬ 
nent as the photo-z error increases. Despite this prob¬ 
lem, there have been some attempts to measure galaxy 
environment for various studies using photo-z samples 
(Capak et al. 2007; Quadri et al. 2012; Scoville ct al. 
2013; Chiang et al. 2014; Lin ct al. 2016). Several works 
have provided viable methods that can be used to re¬ 
cover the density fields of galaxies from photo-z samples 
(Ethcrington & Thomas 2015; Lin ct al. 2016; Malavasi 
et al. 2016). Moreover, Arnalte-Mur et al. (2009) and 
Schlagenhaufer ct al. (2012) both demonstrated that the 
two-point correlation function of galaxies can also be suc¬ 
cessfully recovered from photometric samples, and they 
also discussed the influence of photo-z errors on their 
measurements. 

As several ongoing large sky surveys such as the 
Panoramic Survey Telescope & Rapid Response System 
(Pan-STARRS: Onaka et al. 2008; Kaiser et al. 2010), 
Dark Energy Survey (DES: The Dark Energy Survey Col¬ 
laboration 2005; Albrecht et al. 2006), Hyper Suprime- 
Cam Survey (HSC: Miyazaki et al. 2012) and the up¬ 
coming Large Synoptic Survey Telescope (LSST: LSST 
Science Collaboration et al. 2009) will yield large galaxy 
samples with photometric redshift measurements, it is 
important to understand the potential and limitation 
of the photo-z method in the studies of galaxy evolu¬ 
tion, especially the environmental effects. Can we use 
photo-z samples to measure environment reliably? What 
are the systematics in the environment measurement be¬ 
tween spectral-z samples and photo-z samples? What is 
the optimal choice for density measurement that can re¬ 
liably recover the underlying environments? These are 
the questions that we aim to answer. Particularly, we 
focus on the measurement of galaxy density field. We 
first study the difference between 3D real-space density 
and 2D projected density measurements by using mock 
galaxy catalogs. We adopt the Spearman’s rank correla¬ 
tion coefficient (Spearman 1904), r s , as a measure of the 
correlation between the 2D and the 3D real-space den¬ 
sity. The optimized parameters for the density measure¬ 
ment are obtained by maximizing r s . We then use the 
results of optimized density measurements to show the 
dependence of galaxy properties on environments from 
the mock catalog. And finally, we apply our optimized 
scheme to the Pan-STARRSl data and compare the re¬ 
sults with the measurements by Cooper et al. (2006) who 
use the DEEP2 spectroscopic sample in the same field. 

This paper is structured as follows. In Section 2.1 and 
2.2 we describe the simulation and observational data 
used in our study. The environment measurements used 
in this study are introduced in Section 3. In Section 4 we 
compare the 3D real-space density with 2D projected en¬ 


vironment, and demonstrate how to optimize the choice 
of N th nearest neighbor to improve 2D projected density 
measurement. In Section 5 we show the relation between 
galaxy environment and galaxy properties in the mock 
galaxy catalog to verify whether or not our optimized 
scheme is applicable. We discuss several possible factors 
that might limit our optimized scheme and apply it to 
observations in Section 6. Finally we summarize our re¬ 
sults in Section 7. In this paper, we adopt the following 
cosmological parameters: Hq = lOOh km s -1 Mpc -1 , 
h = 0.73, D 0 = 0.25 and fl A = 0.75. 

2. DATA 

2.1. Simulation Data 

In this work, we use a theoretical mock galaxy catalog 
to understand the systematics in the local density esti¬ 
mates. The advantage of using a mock galaxy catalog 
compared to real spectroscopic survey data is that the 
real-space density can be directly measured and com¬ 
pared with the projected density. Moreover, the mock 
sample does not suffer from the incompleteness which 
often affects real observations. On the other hand, one 
needs to be cautious when interpreting the results since 
the properties of galaxies in the simulation may not be a 
perfect representation of the real Universe. 

The mock galaxy catalog used in this work is built 
based on Millennium simulation with N = 2160 3 in a 
box with volume = 500 3 /i -3 Mpc 3 from redshift z = 127 
to the present day at z = 0 by adopting the following 
cosmological parameters: a baryon matter density fib 
= 0.045, a total matter density D 0 = 0.25, a dark en¬ 
ergy density D A =0.75 and a Hubble constant H 0 = lOOh 
km s -1 Mpc -1 where h = 0.73. These cosmological pa¬ 
rameters match the first-year results of Wilkinson Mi¬ 
crowave Anisotropy Probe (WMAP: Spergel et al. 2003 ). 
Galaxies are put into halos using the GALFORM semi- 
analytical model (Cole ct al. 2000 ) which takes into ac¬ 
count various galaxy formation processes including gas 
accretion and cooling, star formation in galactic disks 
and galaxy mergers. The mock catalog adopts the Lagos 
et al. (2012) model which takes advantage of the exten¬ 
sion to the treatment of star formation introduced into 
GALFORM in Lagos et al. ( 2011 ) to populate galax¬ 
ies, and is then assembled into a lightcone (Merson et al. 
2013 ). Further detailed information is given in Jolc ct al. 
( 2000 ), Springel et al. ( 2005 ), Bower et al. ( 2006 ), Lagos 
ct al. ( 2011 , 2012 ) and Merson et al. ( 2013 ). 

We constructed two types of mock catalogs that mimic 
the observed spectral-z and photo-z catalogs. The mock 
spectral-z catalog can be obtained from primitive simula¬ 
tion data which stores the intrinsic line-of-sight positions 
of galaxies. To generate mock photo-z catalogs, we per¬ 
turb the position of galaxies along the line-of-sight direc¬ 
tion by making a random shift which follows a Gaussian- 
distribution with standard deviation that matches the 
photo-z error in each case, in order to simulate cases 
with observed redshift uncertainties. The new redshift 
obtained can be viewed as the ’’observed redshift”, and 
be used to compute the local density. Although the pho¬ 
tometric redshift model adopted here is oversimplified 
as it does not take into account the effect of catastrophic 
redshift failures, this simplistic model allows us to under¬ 
stand the effect of redshift dispersion. Later in Section 
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5.4, we consider more realistic situations in which the 
outlier effect is included. In this study we consider sev¬ 
eral photo-z cases with uncertainties of 0 ^/( 1 +*) = 0.00, 
0.02, 0.04 and 0.06. We restrict our environment study to 
the redshift range of 0.3 < z < 0.5 for most of our anal¬ 
ysis. A central area of catalogs ~ 16 square degrees is 
selected for our studies, containing ~ 1, 900,000 galaxies 
with i < 25.8. It is worth noting that the redshift which 
we use for computing the 3D overdensity with the Mil¬ 
lennium mock spectroscopic catalogs refers to the intrin¬ 
sic redshift of galaxies, which does not include the effect 
from peculiar velocity. Therefore, the results shown for 
the spectroscopic redshift sample may be too optimistic. 
However, since our main focus is to understand the per¬ 
formance of density recovery in the case of photometric 
errors, this mock spectroscopic catalog does provide a 
’real’ answer of the 3D density. 

2.2. Observation data 

2.2.1. Spectroscopic Observation 

The DEEP2 Galaxy Survey (Davis et. al. 2003 ; New¬ 
man et al. 2013 ) was designed to study the galaxy pop¬ 
ulation and large-scale structure at z ~ 1. It uses 
the Keck II telescopes with the DEIMOS spectrograph 
(Faber et al. 2003 ) and covers ~ 3.5 square degrees of the 
sky with measured spectra and has targeted ~ 60,000 
galaxies down to a limiting magnitude of Rab < 24.1. 
About ~ 60% of the galaxies are sampled over the red¬ 
shift interval 0.2 < z < 1.4. The overall redshift success 
rate is about ~ 70%. DEEP2 comprises four widely sep¬ 
arated fields. One of the DEEP2 fields, the Extended 
Groth Strip (EGS), is enclosed by the Pan-STARRSl 
Medium Deep Survey Field (MD07). In this study we 
match the DEEP2 spectroscopic redshift catalog to the 
Pan-STARRSl MD07 catalog. For galaxies that are 
common in the two catalogs, we compare the local den¬ 
sity measurements computed using the DEEP2 spectral-z 
and Pan-STARRSl photo-z respectively (see Section 6) 

2.2.2. Photometric Observation 

Pan-STARRSl (hereafter PS1) is a 1.8 meter telescope 
equipped with a CCD digital camera with 1.4 billion pix¬ 
els and 3-degree field of view, located on the summit of 
Haleakala on Maui in the Hawaii Islands (Onaka et al. 
2008 ; Kaiser et al. 2010 ). The PS1 observations are ob¬ 
tained in a set of five broadband filters, which we have 
designated as g p \, r p \ , i v \ , z p \ and y p i. There are two 
major components of the PS1 survey which started ob¬ 
servations in 2010: the 37r survey and the Medium Deep 
Survey (MDS) which comprises ten fields spread across 
the sky. One of the MDS fields, namely MD07, is cho¬ 
sen for this study because it overlaps with the EGS field 
which has the spectroscopic from the DEEP2 survey, and 
enables a direct comparison with the environment mea¬ 
surements using the DEEP2 spectral-z sample (Cooper 
et al. 2006 ). Photo-z in MD07 are computed by running 
the EAZY code (Brammer et al. 2008 ) on PS1 5-band 
photometry plus the u*-band data taken by Eugene Mag- 
nier et al. with CFHT MEGACAM as part of the PS1 
efforts. Comparisons against the DEEP2 spectroscopic 
redshifts (Newman et al. 2013 ) show that the PS1 photo- 
z reaches an uncertainty of 0.05 with outlier rate of 7% 


down to r p i < 24.1. More details on the data process¬ 
ing and photo-z characteristics in the PS1 MD07 sample 
are given in Lin et al. (201 !), and Foucaud et al. (in 
preparation). 

3. GALAXY ENVIRONMENT MEASUREMENTS 

In this study, we adopt the V th Nearest Neighbor 
method to quantify galaxy environment. This method 
defines the local density of each galaxy using the dis¬ 
tance to the IV th nearest neighbor galaxy. In other words, 
whether the galaxy is located in an over-dense or under- 
dense environment depends on how far it is to its V th 
nearest galaxy. In simulations, the cosmological redshift 
reflects the ’’real” distance along the line of sight, en¬ 
abling the environment measurement to be evaluated by 
using 3D V th Nearest Neighbor method. On the other 
hand, observationally, the local density is often estimated 
by using a projected method as the measured redshift is 
a combination of both the cosmological distance and the 
peculiar velocity. 

Here we define two sets of galaxy samples: the primary 
and the secondary sample. The primary sample contains 
galaxies brighter than a particular magnitude (mf), and 
is used when presenting the results. The secondary sam¬ 
ple refers to galaxies used in the search for neighbors, and 
is restricted to those galaxies which are brighter than a 
particular magnitude (mf). The limiting magnitude of 
the secondary sample is particularly important because 
it sets the galaxy number densities in the calculation of 
density field. Figure 4 shows the median distances to 
the 3D V th nearest neighbor with various choices of m*. 
The coloured shadows show the range between the min¬ 
imum and maximum distances to the 3D V th nearest 
neighbor. There are several parameters that should be 
considered in the V th Nearest Neighbor method: (1) the 
choice of the V th Neighbor, which represents the scale 
of environment, (2) the magnitude limit of the primary 
sample (to^), (3) the magnitude limit of the secondary 
sample (?nf), and (4) the velocity window (V^ ut ) that de¬ 
fines the redshift boundaries of the neighbors considered 
in the 2D projected method. These parameters should be 
adjusted according to different science goals and galaxy 
samples. One of the goals of this work is to provide an 
empirical framework which determines these parameters 
by calculating galaxy densities with different combina¬ 
tions of parameters in order to understand the influence 
of the parameters. 

3.1. 2D Projected N th Nearest Neighbor Galaxy 
Environment 

In the 2D projected method, the local density of each 
galaxy is computed as the surface density averaged over 
the area enclosed by the V th closest galaxy within the 
velocity interval V cut : 


where n is the N th closest galaxy for each reference 
galaxy and r n is the distance from the reference galaxy to 
the V th closest galaxy on a 2D surface. There is no sim¬ 
ple way to determine the choice of velocity window V); U t 
in the 2D nearest-neighbor method. In principle, it is not 
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meaningful to adopt a Vut that is too small compared to 
the redshift uncertainty of the data. Conversely, adopt¬ 
ing a large velocity cut enlarges projection effect, which 
leads to greater errors in the density measurement. For 
instance, the separation between two galaxies that are 
close in the projected plane may actually be widely sep¬ 
arated in the third dimension, and vice versa (Muldrew 
et al. 2012). Previous studies have utilized the velocity 
interval Vut of a value close to the distance uncertainties 
in the line-of-sight direction, as they found that the den¬ 
sity estimate does not significantly vary when changing 
Vut around this value. We will further test this approach 
using the mock catalog in Section 4.1. 


3.2. 3D IV th Nearest Neighbor Galaxy Environment 

The galaxy environment defined by the 3D N th Near¬ 
est Neighbor method is similar to that defined by the 2D 
projected N th Neighbor except that the projected cir¬ 
cular area is replaced by the enclosed-spherical volume. 
The volume density of galaxies is evaluated using 3D N th 
Nearest Neighbor method as: 


Pn — 


n + 1 
(4/3)7rr3 ’ 


( 2 ) 


where n is the N th closest galaxy of each reference galaxy 
and r n is the distance from the reference galaxy to the 
N th closest galaxy in the three-dimensional space. We 
compute the real-space density using the 3D iV th Near¬ 
est Neighbor method of the simulation where informa¬ 
tion about the three-dimensional positions of galaxies is 
known. We treat the 3D density as the ’’true” density to 
be compared with the 2D density to quantify how well 
the real-space density can be recovered by the 2D pro¬ 
jected density under various conditions. We note that 
practically the 3D density is rarely used even in a spec¬ 
troscopic redshift sample, because the observed ’redshift’ 
includes contributions from both the Hubble flow and the 
peculiar velocity of galaxies, which is not possible to dif¬ 
ferentiate observationally. 

Finally, in order to contrast the most-dense environ¬ 
ments with the least-dense environments, we convert the 
initial primordial density into an overdensity. The over¬ 
density is conventionally defined as the initial primordial 
density divided by the median density as follows: 


1 + A 


A 

D Mdn ’ 


( 3 ) 


where A is the measured density of a galaxy (i.e., A G 
{p n , E„}), and DMdn is the median density computed by 
counting galaxies within a bin of Az = 0.04. The term 
1 + S n is the so-called overdensity, and S n can be 5%° or 
5 2n , depending on A = p n or E n . 


4. QUANTIFYING DIFFERENCES BETWEEN 
ENVIRONMENT MEASUREMENTS 

In this section, we compare the 2D projected den¬ 
sity obtained under various conditions to the 3D real- 
space density. To quantify their differences, we adopt 
the Spearman’s rank correlation coefficient, r s , which is 
commonly used to measure the strength of a relationship 
between two ranked variables (Spearman 1904; Curran 


2014). The Spearman’s rank correlation coefficient is de¬ 
fined as: 


r s = 1 — 


S (s 2 -1)’ 


( 4 ) 


where di is the difference between the ranks of the two 
variables and s is the sample size. A coefficient r s 
= 0 corresponds to no correlation between two vari¬ 
ables, while r s = 1 (-1) corresponds to a perfect-positive 
(perfect-negative) correlation. In our analysis, we first 
measure the 2D projected density E n as well as the 3D 
real-space density p n for each galaxy, and then convert 
all the density measurements into an overdensity as de¬ 
fined in the last section. After ranking the 3D real-space 
overdensity and 2D projected overdensity, we compute 
the difference di between the ranks of the two overden¬ 
sities, and then use Formula 4 to calculate Spearman’s 
rank correlation coefficient r s . 

Figure 5 is an example showing the correlation between 
the 3D and 2D measurements using JV 2 d = 6, 30, 60 and 
90 respectively, and how the r s coefficient changes with 
different choices of N 2 d- 


4.1. 3D Galaxy Environment vs. 2D Projected Galaxy 
Environment with Different Parameters 

We first probe the effect of the velocity interval Vut 
on the 2D projected density measurement. We restrict 
the sample with m? < 25 and m® < 25 when calculat¬ 
ing 2D projected overdensity, 1 + Sq D , and 3D real-space 
overdensity, 1 + 5% D . Figure 6 shows the scatter plots of 
the 2D projected overdensity versus the real-space over¬ 
density in log scale using the 6 th nearest neighbor. Here 
we consider the following four different choices of Vut in 
the 2D projected measurement for a galaxy sample with 
photo-z error = 0.04(1 + z): ±0.005(1 + z). ±0.02(1 ± 2 ), 
±0.04(1 ± z) and ±0.06(1 ± z). It can be seen that the 
difference in r s among the four cases of Vut is not sig¬ 
nificant when the size of velocity interval is close to the 
photo-z error. For example, the r s of 0.434 is obtained 
in the case of Vut = ±0.02(1 ± z) (upper-right panel of 
Figure 6), while the value of r s increases to 0.473 in the 
case of V cut = ±0.06(1 ± z) (bottom-right panel of Fig¬ 
ure 6). The difference in r s is small (^0.039) between 
these two cases even though the Vut differs by a fac¬ 
tor of 3, which confirms the finding in previous studies 
that the 2D projected measurement is not sensitive to 
Vut when Vut is comparable to the photo-z uncertainty 
(Gallazzi et al. 2009; Muldrew et al. 2012). We further 
repeat similar exercises using samples with a larger red¬ 
shift uncertainty up to 0.08(1 ± 0 ) and at higher redshifts 
(0.6 < 2 < 0.8) and find that this conclusion still holds. 
Therefore throughout this work, we set Vut to be the 
typical photo-z uncertainty of the galaxy sample. 

Next we consider the effect of the secondary magni¬ 
tude limit employed on the galaxy sample when search¬ 
ing for neighbors. A brighter (fainter) to* probes a larger 
(smaller) scale of environment for a fixed N th nearest 
neighbor. It is therefore expected that the correlation 
between the 2D projected and 3D real-space environ¬ 
ment could depend on the choice of to*. Again we se¬ 
lect the sample with photo-z error = 0.02(1 ± z) and set 
Vut = ±0.02(1 ± z) for the reason given above. Figure 
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7 shows the scatter plots of the 2D projected overden¬ 
sity versus real-space overdensity using the 6 th nearest 
neighbor, and both 2D and 3D environments are mea¬ 
sured using mf < 21, mf < 23 and mf < 25 respectively. 
As it can be seen, the largest r s is obtained when the mf 
in 2D measurement is equal to the mf in 3D measure¬ 
ment. Furthermore, for identical mf used in the 2D and 
3D measurements, the environments measured by using 
fainter mf (and hence smaller scales) have better corre¬ 
lation than those measured using a brighter mf. This is 
consistent with the results from Shattow et al. (2013), 
which also shows that for samples with photo-z error, 
the 2D projected environments have a weaker correla¬ 
tion with 3D real-space environments on larger scales. 

Finally, Figure 8 shows the 2D versus 3D scatter 
plots in the density measurement to understand how 
the galaxy environment is affected by the photo-z errors. 
Here we consider mf < 25 and mf < 25, and vary photo- 
z error from 0, 0.02(1 + 2 ), 0.04(1 + 2 ) to 0.06(1 + 2 ). V/ Ut 
is correspondingly set to be ±0.001(1 + z), ±0.02(1 ± z), 
±0.04(1 ± z) and ±0.06(1 ± z) respectively. It is worth 
noting that even under the perfect situation where the 
redshift error is zero, the correlation is still not per¬ 
fect owing to the projection effect. The correlation be¬ 
tween 3D real-space and 2D projected environments be¬ 
come gradually worse when the photo-z error increases. 
However, there still exists some correlation especially for 
galaxies located in high density regions, while the envi¬ 
ronment in lower and intermediate densities is less distin¬ 
guishable. This is consistent with the result from Capak 
et al. (2007), who also shows that the galaxy environ¬ 
ment is difficult to measure for galaxies located in low 
dense regions when a redshift error is present. 

4.2. Optimizing 2D Environment Parameters 

So far we have compared 3D real-space environments 
with various 2D projected environments to show their 
correlation and we have adopted r s to quantify the good¬ 
ness of the correlation. We now expand this to construct 
an optimization scheme to determine the value of A^d 
which gives the best correlation (largest r s ) between the 
2D and 3D environments. Figure 9 shows the r s cal¬ 
culated by fitting 2D projected and 3D real-space envi¬ 
ronment with various choice of A^d and AI 3 n- The red, 
green, blue, cyan and magenta dots are for the cases 
where the 2D projected environments are calculated us¬ 
ing A^ 2 d = 6, 30, 60, 90 and AT 3 d respectively. We also 
mark the four cases of Figure 5, r s = 0.425, 0.455, 0.503 
and 0.549 respectively, to demonstrate how r s varies with 
different choices of A^d- In the following analysis, we 
only show the optimized choice of A^d corresponding to 
the case which yields the largest r s , as a function of A7 3 d- 

Figure 10 shows the largest r s (upper-panel) and cor¬ 
responding choice of N 2 d (bottom-panel) that yields the 
best correlation between 2D projected and 3D real-space 
environments for different choices of A^d, as a function 
of AT 3 d from mock galaxy catalogs. The red, green, blue 
and cyan dots are for samples with different photo-z er¬ 
rors: 0.00, 0.02(1+2), 0.04(1+2), 0.06(1+2) respectively. 
As expected, the densities are relatively easier to recover 
for samples with lower photo-z errors than those with 
higher photo-z errors. The r s obtained for the case with¬ 
out photo-z error (red dots) are greater than 0.9, mean¬ 
ing that the optimized 2D projected environments are 


strongly correlated with the 3D real-space environments. 
However, for the samples contaminated by photo-z errors 
(green, blue and cyan dots), the recovering performance 
becomes gradually worse as the photo-z error increases. 
In addition, the correlation between 2D projected and 
3D real-space environments does not depend only on the 
redshift accuracy, but is also scale dependent. For ex¬ 
ample, at a given photo-z error, r s decreases with A^d, 
which suggests that small-scale environments are more 
easier to recover. A possible explanation is that the 2D 
projected environments calculated by using photo-z sam¬ 
ples might include more contaminations when we probe 
the larger scale of environments. 

One interesting feature in the bottom panel of Figure 
10 is that the best choices of N 2 d are in general not 
equal to A^ 3 d, but only half of AT 3 d, for producing the 
largest r s except for the case of error-free. However, we 
note that the relation between the optimized N 2 d and 
A r 3 n depends on the choices of redshift interval. Detailed 
discussions are given in Appendix A. 

As the value of density measurements also depends on 
the choice of the secondary magnitude limit, it is interest¬ 
ing to see how the correlation between 2D and 3D mea¬ 
surements change by varying the secondary magnitude 
limits in both 3D and 2D environment measurements. 
Figures 11 - 13 show the largest r s as a function of A^ 3 d 
for different 3D secondary magnitude limits respectively. 
In each figure, the red, green and blue dots are for sam¬ 
ples with secondary magnitude limits corresponding to 
mf < 21, mf < 23, and mf < 25 in 2D environment 
measurement respectively, at a fixed secondary magni¬ 
tude limit in 3D local density and a fixed photo-z error 
= 0.02(1 + 2 ). Our results show that when the 3D envi¬ 
ronment is defined using brighter secondary magnitude 
limits, there is no significant difference in the recover¬ 
ing performance among different choices of 2D secondary 
magnitude limit that are fainter than the 3D secondary 
magnitude limit. On the other hand, adopting a 2D sec¬ 
ondary magnitude limit that is brighter than the 3D sec¬ 
ondary magnitude limit results in a poorer environment 
recovery. This means that a deeper sample is favored 
when constructing the 2D density field. 

5. THE CORRELATION BETWEEN 
ENVIRONMENT AND GALAXY PROPERTY 

In Section 4 we optimized the choice of N 2 d for the 2D 
projected density measurement to yield the best corre¬ 
lation with the 3D real-space environments by using the 
r s metric. In this section, we use these optimized results 
to study how the color-density relation in the simula¬ 
tion changes when varying the photo-z uncertainties and 
outlier rates. Although the density-color and/or halo 
mass-color relations seen in the simulations may not fully 
represent the observed Universe, this provides us with a 
guideline to understand how reliably we can study the 
dependence of galaxy properties on environment using 
the photo-z samples. 

5.1. Environment vs. Galaxy Color 

To explore the relation between galaxy color and en¬ 
vironment, we compare the apparent magnitude i versus 
g — i colors of galaxies located in the 20% most dense 
and the 20% least dense galaxy environments respec- 
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tively. We note that although conventionally the color- 
magnitude relation is defined in the rest frame quantity 
when studying the color-density relation, here we only 
look at the observed quantity since the redshift range is 
very small and our main purpose is to see if the density 
dependence of color distributions can still be revealed in 
the photometric redshift sample, rather than quantify¬ 
ing the ’color-density relation’ itself. Galaxies are first 
classified to be red or blue according to their locations in 
the observed Color- Magnitude Diagram (CMD). We use 
g — i = 1.5 and 1.75 as dividing lines to separate blue and 
red galaxies at 0.3 < z < 0.5 and 0.6 < z < 0.8, respec¬ 
tively. Next we bin the galaxies according to their i-band 
apparent magnitude and then compute the percentage of 
red galaxies defined as: 


/red — 


Ared 

-Win ' 


(5) 


where -Win is the total number of galaxies and A/ed is 
the number of red galaxies in each bin. 

We determine the choice of -Wd that yields the largest 
r s for photo-z samples with different photo-z uncertain¬ 
ties using the methodology described in Section 4.2. In 
the case where we study the color-density relation for 
the environment scale corresponding to the 6 th nearest 
neighbor in the 3D space, i.e., pe, it is found that the 
optimized N 2 d = 6, 6, 6, 12 is for the cases with photo- 
z error = 0.00, 0.02(1 + z), 0.04(1 + z) and 0.06(1 + z) 
respectively (see the bottom panel of Figure 10). We 
note that in the case of photo-z error = 0.06(1 + z), al¬ 
though -Wd = 12 is the best choice for optimization, we 
still adopt N 2 d = 6 to show its CMD for convenience 
as there is almost no significant difference between using 
Wd = 6 or Wd = 12 for the optimization. 

The upper-panels of Figure 14 show the CMD for the 
20% most dense environments (red-contour) and the 20% 
least dense environments (blue-contour) for galaxy sam¬ 
ples with different photo-z errors. Here we use the 2D 
projected measurements S 6 and set Wut = ±0.001(1+2), 
±0.02(1 + z), ±0.04(1 + z) and ±0.06(1 + z) for the 
case with photo-z error = 0.00, 0.02(1 + 2 ), 0.04(1 + 2 ) 
and 0.06(1 + 2 ) respectively. All cases are considered for 
< 25 and to* < 25. Here the contours connect points 
with equal pixel-density in the CMD. The lower-panels 
of Figure 14 show the red fraction, / re <j, as a function 
of i-band apparent magnitude for local densities corre¬ 
spond to the 20% most dense (red), 60% - 80% densest 
(orange), 40% - 60% densest (yellow), 20% - 40% dens¬ 
est (green) and 20% least dense (blue). The error bars 
show the 1-a Poisson uncertainty in each bin. 

It is clear that in the simulation, galaxy colors are 
strongly correlated with environment: being redder in 
denser environments. In the case where there is no photo- 
z error, the red and blue contours occupy distinct regions 
in the CMD. This trend is in good agreement with obser¬ 
vational results (Balogh et al. 2004; Cooper et al. 2006, 
2007; Cassata et al. 2007). As the photo-z error increases 
(from left to right), red and blue contours begin to over¬ 
lap. To further quantify the influence of the photo-z un¬ 
certainty on the CMD, we plot the red fraction as a func¬ 
tion of i-band apparent magnitude for galaxies located 
in 5 different density percentiles (bottom panel of Figure 
14). The difference in the red fraction becomes gradu¬ 


ally smaller with increasing photo-z error. Considering 
the case without photo-z error and galaxies rrii > 20, the 
difference in red fraction between the 20% most dense 
and 20% least dense environments ranges from 0.5 to 0.6, 
but decreases to 0.3 - 0.4 in the case of photo-z error = 
0.06(1 + 2 ). Nevertheless, it is still encouraging that even 
in the worst case (photo-z error = 0.06(1 + 2 )), the de¬ 
pendence of the red fraction on galaxy environment can 
still be seen. 

Figure 15 presents similar information to Figure 14, 
but now for the cases with various secondary magnitude 
limits. Here we consider the cases with m* < 25, to* < 23 
and m\ < 21. All the three cases are considered using 
to^ < 25. To remove additional uncertainties due to pro¬ 
jection effects, the density ranking is based on the 3D 
real-space environments, p§. Our results show that the 
environment measured with fainter secondary magnitude 
limits yield a better correlation with galaxy properties. 
This result is somewhat expected because the scales of 
the environment defined by different mf are different for 
a given neighbor N, being greater for brighter mf. If the 
color-density relation is scale dependent, it can lead to 
the dependence on the adopted mf. The degraded color- 
density relation with brighter sample in Figure 15 could 
be due to environmental effects on color being weaker 
with increasing environment scale. Furthermore, fainter 
sample is spatially denser and therefore contains more in¬ 
formation about environment than a sparse sample does. 
Including more galaxies in the density estimate thus also 
help to characterize the environments. We will inves¬ 
tigate the correspondence between galaxy environment 
and dark matter halo mass in the next section. 

5.2. Environments vs. Dark Matter Halo Mass 

Observationally it is found that galaxies located in 
massive halos, such as groups and clusters, are in gen¬ 
eral formed earlier and hence are more evolved, com¬ 
pared to galaxies in the held (Capak et al. 2007). Semi- 
analytic galaxy formation models have also successfully 
reproduced the observed trend (Lemson & Kauffmann 
1999; Benson ct al. 2000; Haas ct al. 2012; Muldrew et al. 
2012). Figure 16 shows the red fraction as a function of 
dark matter halo mass in our mock catalog based on the 
model of Lagos et al. (2012). As can be seen, the red 
fraction increases rapidly toward massive halos, in good 
agreement with observation. 

We now proceed to show the correlation between host 
halo mass and overdensity in order to understand why 
the galaxy environment calculated using a fainter sec¬ 
ondary magnitude limit has a better correlation with 
galaxy color. We adopt a method similar to the one 
used in Muldrew et al. (2012) which is to plot the rela¬ 
tionship between host halo mass and overdensity. The 
upper-panels of Figure 17 show the correlations between 
host halo mass and the 3D overdensity for the galaxy 
samples with different secondary magnitude limits. In 
general we find that even in the case of zero redshift er¬ 
ror, the 3D environment measures are a poor tracer of 
mass for individual objects as revealed by the large scat¬ 
ters between 3D density field and halo mass. Similar to 
what is found by Muldrew et al. (2012), for low IV 3 D a 
galaxy found at high 3D density is actually more likely 
to be in a low-mass halo than in a high-mass one. Fur¬ 
thermore, our results show that for a fixed IV 3 D, the low 
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density region probed using a brighter mf has a wider 
spread in halo mass. In contrast, the low density region 
measured using a fainter ?nf is dominated by galaxies lo¬ 
cated in small halos (< 10 12 Mq) where the red fraction 
drops significantly with decreasing halo mass (see Fig¬ 
ure 16). This is because the scale of the density defined 
by a fainter magnitude limit for a fixed IV 3 D is typically 
smaller and less contaminated by the 2 -halo term, and 
therefore is a better tracer of the halo mass, except for 
very massive halos (> 10 14 Mq). 

This point is further illustrated in the lower panels 
of Figure 17 where we show that smaller IV 3 D yields a 
stronger correlation between overdensity and host halo 
mass than using larger A^d- Therefore the tighter re¬ 
lationship between the galaxy colors and densities com¬ 
puted using a fainter secondary magnitude limit seen in 
Figure 15 can be attributed to the fact that the environ¬ 
ment defined using a fainter sample traces more closely 
with host halo masses. 

Figure 18 shows l + df 13 versus halo mass for the galax¬ 
ies with photo-z errors varying from 0.0 to 0.06(1 + z). 
The overdensity 1 + 5q D increases with host halo mass 
but with a large scatter, as seen in Muldrew et al. 
(2012) which is based on different simulations. Neverthe¬ 
less, the correlation becomes progressively weaker when 
photo-z uncertainty increases. In the case of photo-z = 
0.06(l + ;z), the correlation is almost flat, suggesting that 
the 2D projected density is no longer a good tracer of 
halo mass when photo-z errors are non-negligible. 

5.3. Comparison with Spectroscopic Observation 

In previous sections, we present how the photo-z un¬ 
certainty can have an impact on the measurement of lo¬ 
cal density and discuss how well the 2D projected den¬ 
sity traces the real-space density when adopting different 
choices of the size of velocity (redshift) window, magni¬ 
tude limit, and the IV th nearest neighbor. Furthermore, 
we have also studied how the color-density relation is af¬ 
fected by the presence of photo-z errors and as a function 
of the magnitude limit that is applied to the secondary 
sample. In this section, we further investigate the dif¬ 
ference in the local density measurements between the 
photo-z and spectral-z sample using mock galaxies, by 
taking into account more realistic situations including 
the incompleteness of the spectroscopic sample. 

As we discuss in Section 4, the photo-z uncertainty has 
a strong impact on the correlation between the 2D and 
3D densities. Ideally, the density measurement based on 
spectral-z sample is more reliable. However, the spec¬ 
troscopic observations of galaxies are time-consuming 
and hence are normally limited to a small sample size, 
brighter galaxies, and a lower redshift range. Moreover 
they often suffer from incompleteness due to the limited 
observing time as well as the fiber and/or slit collisions. 
In contrast, the photo-z can be relatively easily obtained 
down to fainter galaxies and out to higher redshifts with 
a much larger size of sample, but with the drawback that 
the redshift resolution is substantially poorer compared 
to spectral-z. Nevertheless, both spectral-z and photo-z 
samples have been used on environment studies (Cooper 
ct al. 2006; Elbaz et al. 2007; Cassata et al. 2007; Quadri 
ct al. 2012). It is thus interesting to investigate to what 
extend the local density from photo-z samples can be 


compared to that of the spectral-z samples. To do so, 
we randomly choose part of the entire spectral-z samples 
from the mock catalogs to simulate the spectral-z sam¬ 
ples with different percentage of completeness. We then 
apply our optimized scheme as introduced in Section 4.2 
to both incomplete spectral-z samples and photo-z sam¬ 
ples to compare their r s . 

Figures 19 - 21 show r s as a function of -/V 3 D for 
spectral-z samples with different completeness of 10 %, 
20%, 40%, 60%, 80% and 100% (denoted by different 
colors). For comparison, we also overplot the results of 
photo-z sample with different photo-z uncertainties pre¬ 
sented in different line styles in the top-left panel. To 
fairly compare r s on the same physical distance scales 
among various cases, r s is calculated using the IV 3 D that 
corresponds to the same N th nearest neighbor in the case 
of 100% complete spectroscopic sample. A zoomed-in 
version on small scales are shown in the top-right panel. 
The difference among the three figures (19 - 21) is the 
secondary magnitude limit applied. For example, in Fig¬ 
ure 19 we consider the case where galaxy environments 
are calculated by using mf < 25.0. It shows that the 
galaxy environments calculated by using an incomplete¬ 
ness spectral-z sample with this deep magnitude selection 
are more reliable than those galaxy environments calcu¬ 
lated by using a complete photo-z sample. 

However, in general it is difficult to obtain spectro¬ 
scopic redshifts for a large sample of very faint galaxies. 
For example, the DEEP2 survey (Newman et al. 2013) 
is limited to R < 24.1 and the zCOSMOS bright sam¬ 
ple (Lilly ct al. 2007) is limited to i = 22.5. Next we 
vary the secondary magnitude limits of the spectroscopic 
sample to see how the trend changes. Here we consider 
the two optimized results, mf < 24.1 and mf < 22.5, 
to roughly mimic the results of DEEP2 sky survey and 
zCOSMOS-bright survey, respectively. Strictly speaking 
the DEEP2 is limited in the i?-band instead of i-band, 
however, here we simply adopt the *-band in order to 
reveal the trend more clearly. Our results show that 
in the case of mf < 24.1, the performance of density 
recovery with photo-z error as low as ~ 0 . 02(1 + z) is 
always worse than that of the spectral-z samples. How¬ 
ever, when the magnitude limit of the spectral-z samples 
decreases to mf < 22.5, the performance becomes com¬ 
parable to the spectral-z sample with 10 % completeness. 
In other words, as the spectral-z sample gets brighter, the 
r s coefficients between the spectral-z and photo-z sam¬ 
ples become closer. However, the difference between their 
r s coefficients gradually becomes larger when we probe a 
larger scale of environment, as described in Section 4.2. 
That is, a deeper photo-z sample can yield similar per¬ 
formance as good as an incomplete, shallower spectral-z 
sample, but this is only restricted to small-scale environ¬ 
ments. 

5.4. Effect of Outliers 

So far the studies on the effect of the photo-z uncer¬ 
tainty on the density measurement are carried out by 
perturbing the redshifts of mock galaxies with a Gaussian 
function. However, this method does not totally mimic 
the realistic case because the photo-z errors may not ex¬ 
actly follow the Gaussian distribution. For example, in 
the cases where there are not enough numbers and/or 
wavelength coverage of bandpasses, the feature of the 


Lyman break (~ 912 angstrom) can be misidentified as 
Balmer break (~ 4000 angstrom) and vice versa, leading 
to a catastrophic failure in the photo-z estimation, the 
so-called ‘redshift outliers’ (Brough et al. 2013). Next 
we study how the outlier rate influences the correlation 
between the 2D and 3D local density measurements. 

To simulate galaxy samples with outliers, we randomly 
choose part of the entire simulation samples according to 
the desired outlier rate, and assign them a new redshift 
randomly between 0 - 2.0. Figure 22 shows the results us¬ 
ing samples with photo-z = 0.02(1 + z) and four different 
percentages of outliers: 5%, 10%, 15% and 20%. Similar 
to Figures 19 - 21, we also mark the results for different 
photo-z uncertainties in the case of 0% outliers with dif¬ 
ferent line styles for comparison. From this figure, we can 
see that r s also strongly depends on the outlier fraction, 
becoming worse as the outlier fraction increases. The ef¬ 
fect is similar to the degradation of photo-z uncertainty 
and the completeness of the sample. For example, for 
the sample with photo-z error = 0.02(1 + z) and outlier 
rate = 10% (green dots), we find its optimized result is 
similar to the result of outlier-free sample with photo-z 
error = 0.04(1 + z). 

Figure 23 shows the CMD for the samples with photo- 
z error = 0.02(1 + z) and four different percentages of 
outliers. The environments are measured by using Eg, 
< 25.0 and rof < 25.0. As it can be seen, in 
the case with 5% outlier, it is comparable to a non¬ 
outlier case with photo-z error between ~ 0.02(1 + z) 
and ~ 0.04(1 + z) and the color-density relation can 
still be revealed. However, as the outlier rate goes up 
to 10%, the density measurements for under-dense en¬ 
vironments (for example, the least 20% (blue) and 20% 
- 40% (green) dense environments) are no longer distin¬ 
guishable, resulting in a weaker color-density relation. 
This is in contrast to the situations with pure photo-z 
errors for which the lowest density curves remain dis¬ 
tinguishable. The outliers have larger effects in lower 
density environments because the change in the density 
measurements are proportionally larger in those regions 
when some fraction of galaxies are scatted inside or out 
the relevant redshift window. 

6. COLOR- DENSITY RELATION FOR 
PAN-STARRS1 DATA 

The main purpose of this work is to understand the sys- 
tematics in the 2D density measurement and its limita¬ 
tion, with the ultimate goal of its application to the ongo¬ 
ing and future large photometric surveys. So far we have 
explored various aspects of the density measurements by 
using mock galaxy catalogs for which the real-space den¬ 
sity is known. We have considered several factors such as 
photo-z uncertainty, magnitude limit, completeness and 
outlier rate that make simulation data as similar to re¬ 
alistic samples as possible. However these factors are 
still not sufficient to imitate the realistic samples. An 
alternative is to compare the results of the overlapping 
samples directly between photo-z and spectral-z surveys. 
We adopt this approach by using the PS1 MD07 photo¬ 
metric redshift catalog (Lin et al. 2014) as it covers the 
well-known EGS field, which has the spectroscopic red- 
shifts from DEEP 2, allowing for a direct comparison of 
the density measurement. 

We first compute the environments using galaxies with 


spectroscopic redshifts and compare these environments 
with those calculated by using photometric redshifts 
from Pan-STARRSl for the same galaxies. For compar¬ 
ison, we also utilize the mock galaxy catalogs described 
in Section 2.1 and perturb their redshifts to simulate the 
photo-z conditions of Pan-STARRSl where the typical 
error is ~ 0.06(1 + z) and the outlier rate is ~ 6%. Fig¬ 
ure 24 shows the scatter plot for galaxies in the EGS 
field (left-panel) and in the simulations (right-panel). In 
the left panel, The 2D and 3D environments are eval¬ 
uated by using samples from Pan-STARRSl (photo-z) 
and DEEP2 (spectral-z) respectively. As it can be seen, 
while we compare the 2D projected and 3D environments 
in the realistic case, the slope of scatter plot is similar to 
simulation result but r s is smaller than the results of sim¬ 
ulation. While this might be explained by the intrinsic 
difference in the galaxy spatial distribution between the 
real Universe and simulations, it is also noticed that the 
galaxies in the MD07/EGS field span a narrower range in 
the 3D overdensity because of the smaller field size such 
that the extreme environments are not well-sampled. As 
a result, the simulated sample includes very dense en¬ 
vironments which are more discernible and easier to be 
recovered compared to intermediate environments. Nev¬ 
ertheless, there still exists a weak correlation in compar¬ 
ison with real data even though their r s coefficients are 
smaller than those of the simulated data. 

To know whether the color-density relation can still 
be revealed in the realistic photo-z sample, in the Figure 
25 we plot the CMD (upper panels) and the red frac¬ 
tions versus i p i magnitude (lower panels) for galaxies 
located in the 20% most dense (red contour) and 20% 
least dense (blue contour) galaxy environments. The red 
fraction is defined as the ratio of the number of galax¬ 
ies with g p i — i p i color redder than 1.5 to that of the 
full sample. For each galaxy in the right-panel of PS1, 
we compute Eg with mf < 24, using the photometric 
redshifts derived in Lin et al. (2014) with the redshift 
range of 0.3 < 2 < 0.5. The left panel of Figure 25, 
which is for comparison, shows the DEEP2 result us¬ 
ing galaxy densities computed by Cooper et al. (2005). 
Their density measurements have been corrected for sev¬ 
eral effects such as the survey edges, redshift precision, 
redshift-space distortion and target selection as described 
in Cooper et al. (2005). Therefore, their measurements 
can be regarded as the ’true’ answer in this comparison. 
The middle panel shows the result calculated with PS1 
photometric redshifts only for galaxies located in the re¬ 
gion overlapping with the EGS field which is ~ 0.5 deg 2 
(~ 1,500 galaxies), while the right panel shows the re¬ 
sult based on the entire PS1/MD07 field of ~ 5 deg 2 
(~ 25, 000 galaxies). To minimize the impact of the edge 
effects, we also exclude galaxies near the survey bound¬ 
aries when showing the color-density relation. Among 
all the three samples, the color-density relation is only 
significantly detected (> 3<r) in the ~ 5 deg 2 PS1 photo- 
z sample. This is because although the photo-z uncer¬ 
tainty in general contaminates the density measurement, 
which leads to some systematics in the color-density re¬ 
lation, the random errors can be largely improved given 
the large volumes probed by a photometric survey. In 
other words, the reduced errors due to the larger sample 
are sensitive enough to allow for the detection of a ’de- 
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graded’ relation between red fraction and environment. 

Furthermore, we also extend our study from low red- 
shift range (0.3 < z < 0.5) to higher redshift range 
(0.6 < z < 0.8). In this redshift bin, g p \ — i v \ = 2.0 
is used to separate blue and red galaxies. Similarly we 
first show the color-density relation at redshift range 
0.6 < z < 0.8 using our simulated dataset (Figure 26) 
and PS1 samples (Figure 27, ~ 45,000 galaxies). As 
shown in the right panel of Figure 27, the difference 
in the red fraction between two extremely environments 
is still detectable at ~ 2 — 3er level. The Kolmogorov- 
Smirnov test (Andrade et al. 2001) on the color distribu¬ 
tions for galaxies with 21 < i p i < 23 located in the most 
20% and least 20% density percentiles returns a value of 
p « 0.1%, rejecting the null hypothesis that the color 
distributions of galaxies are drawn from the same popu¬ 
lation. 

7. CONCLUSIONS 

In this work, we have studied how the 2D projected 
environment correlates with the 3D real-space environ¬ 
ment. Using the Durham mock galaxy catalogs, we inves¬ 
tigate various parameters in measuring the 2D projected 
environment and find the best parameters which maxi¬ 
mize the Spearman’s rank correlation coefficient, defined 

as r s = 1 — ^ j , which is a quantity for quantifying 
the correlation between 3D real-space and 2D projected 
overdensities. When applying the N th nearest neighbor 
method to the PS1 photo-z sample, we show that color- 
density relation can still be revealed despite of the sizable 
photo-z errors inherent in the data. Our main conclu¬ 
sions are as follows. 

(i) The correlation between the 2D projected and 3D 
real-space overdensity is sensitive to the photo-z uncer¬ 
tainty. Smaller N 3 d is recommended for photo-z sam¬ 
ples to achieve a better correlation (larger r s ) between 
2D projected and 3D real-space environments. 

(ii) As the scale of 3D real-space environment increases, 
the r s derived by using spectral-z and photo-z samples 
show the opposite trend: the correlation becomes grad¬ 
ually stronger for the spectral-samples but worse for the 
photo-z samples. 

(iii) The 2D projected environment measurements are 
less sensitive to the redshift interval (14ut)- The redshift 
interval comparable to the photo-z uncertainty yields 2D 
projected overdensity that reasonably traces the real- 
space density. 

(iv) The magnitude limit should also be considered 
when computing local densities of galaxies. The 2D envi¬ 
ronments measured with fainter magnitude limits yield 
better correlation with the 3D real-space environments 
derived from the same limiting magnitude sample for a 
fixed JV 3 d • In addition, the color-density relation is more 
prominent if the density is measured using fainter mag¬ 
nitude limits for a fixed A^d- This is because the over¬ 
density computed with fainter magnitudes proves smaller 
scales with the same A^ 2 d, and traces the hosting halo 
mass of galaxies better. 

(v) Considering the case calculated by using galaxy 
samples at 0.3 < z < 0.5 with m ? < 25.0 and mf < 25.0, 
the recovering performance of small-scale environments 
for photometric redshift samples with redshift uncer¬ 
tainty of 0.02(1 + z) is roughly comparable to that for 


shallower i ~ 22.5 spectroscopic redshift samples with ~ 
10% completeness. In addition, the effect of catastrophic 
failures in the photo-z measurements on the density mea¬ 
surement is similar to that of the photo-z errors. 

(vi) Using Durham mock galaxy catalogs in the 
redshift range of 0.3 < z < 0.5, we show that the 
density-dependent red fraction can still be revealed in 
photometric redshift samples with photo-z uncertainty 
up to 0.06(1 + z). Similarly with photo-z sample 
from PS1, we show that the color-density relation is 
also present in the sample whose photo-z uncertainty 
is ~ 0.06(1 + z) and the outlier rate is ~ 6%, but 
the significance strongly depends on the sample size. 
Based on the results of PS1 in the two redshift bins 
(0.3 < ^ < 0.5 and 0.6 < z < 0.8), we recommend 
that the survey size should at least exceed ~ 5 deg 2 
in order to yield > 3cr results. Larger fields will be 
required in order to reduce the Poisson errors if going 
to higher-redshifts as the color-density relation is less 
prominent and the number density of galaxies is reduced. 
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APPENDIX 

A. 7V 3D VS. N 2D IN ENVIRONMENT 
MEASUREMENTS 

As mentioned in Section 4.2, the optimized IV 2 D is only 
half the value of iV 3 d ■ The ratio of the two quantities in 
fact depends on the size of the redshift interval when 
computing the 2D density. Figure 28 shows the opti¬ 
mized scheme for the case with different size of I4ut ■ We 
consider the sample with photo-z error = 0.04(1 + 2 ) and 
calculate the environment with different sizes of I4ut = 
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±0.005(1+2), ±0.02(1+2), ±0.04(1+2) and ±0.06(1+2), 
corresponding to red, green, blue and cyan dots respec¬ 
tively. As it can be seen, for the cases with size of 14 u t = 
±0.02(1 + 2 ), ±0.04(1 + 2 ) and ±0.06(1 + 2 ), their opti¬ 
mized results are quite similar although the best choices 
of N- 2 B are different. The ratio of the optimized N 2 r> to 
IV 3 D, roughly, is 1:10, 1:5, 1:2.5 and 1:1.7 for the cases 
with V cut = ±0.005(1 + 2 ), ±0.02(1 + 2 ), ±0.04(1 + 2 ) and 
±0.06(1 + 2 ), respectively. The N 2 d in the case of 14 ut 
= ±0.06(1 + 2 ) is tripled compared to -/V 2 d in the case of 
Tc Ut = ±0.02(1 + 2 ). Next we investigate how strong the 


effect of N 2 d is on r s . In Figure 29 we plot the differ¬ 
ence between the two r s , one is evaluated by using best 
choice of N 2 d, and another is evaluated by _/V 2 d = -/V 3D , 
normalized by the former, as a function of 7V 3 d- As can 
be seen, their maximum difference is only ~ 9%, 3% and 
3% in the case with 14 ut = ±0.02(1 + 2 ), ±0.04(1 + 2 ) 
and ±0.06(1 + 2 ) respectively. This suggests that if the 
size of V cut comparable to the photo-z uncertainty, the 
2D local density measured with jV 2 d ~ IV 3 d could be as 
good as that derived with the optimized N 2 t>. 
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Fig. 1.— Spatial distribution of mock galaxies projected onto the plane of the sky with redshifts perturbed corresponding to different 
photo-z errors, 0.00, 0.02(1 + z), 0.04(1 + z) and 0.06(1 + z) at 0.3 < z p hoto < 0.35. 
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Fig. 2.— Spatial distribution of mock galaxies seen in one line-of-sight projection and redshift. A 0.05 degrees interval in DEC is used 
when projecting galaxies onto the plane. The redshift of galaxies are perturbed according to different photo-z errors, 0.00, 0.02(1 + z), 
0.04(1 + z) and 0.06(1 + z). The red rectangle indicates the redshift range 0.3 < 2photo < 0.35 used in Figure 1. 
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Fig. 3.— Spatial distribution of mock galaxies seen in one line-of-sight projection and redshift. A 0.05 degrees interval in RA is used 
when projecting galaxies onto the plane. The redshift of galaxies are perturbed according to different photo-z errors, 0.00, 0.02(1 + z), 
0.04(1 + z) and 0.06(1 + z). The red rectangle indicates the redshift range 0.3 < z p hoto < 0.35 used in Figure 1. 
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Fig. 4.— The median IV th nearest-neighbor distance as a function of N$y> for various choices of the magnitude limit of the secondary 
sample (mf). The shadows of different colors show the range between the minimum and maximum ;Y lh nearest-neighbor distance. 
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Fig. 5. — The scatter plot of the 3D real-space overdensity, 1 versus 2D projected overdensity: 1 + Sq D (upper-left panel), 1 + 5^ 

(upper-right panel), 1 5 qq (lower-left panel) and 1 + Sqq* (lower-right panel) in log-scale. The numbers printed in the bottom-left of each 

panel indicate the r s coefficient. The black dash-dot lines represent the best fit to the data points, the red dash-dot lines represent the 
one-to-one relation and the contours show the regions of constant galaxy number. 
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Fig. 6.— The scatter plot of the 3D real-space overdensity, 1 + 8q D , versus 2D projected overdensity, 1 -|- Sq D , with various velocity cuts: 
Vcut = =b0.005(1 + z), d=0.02(l + z), d=0.04(l -\-z) and ±0.06(1+ z) over the redshift interval 0.3 < 2 : < 0.5. All cases are considered by using 
galaxy samples with photo-z error = 0.04(1 ± 2 ), m| < 25 and m? < 25. The numbers printed in the bottom-left of each panel indicate the 
r s coefficient. The black dash-dot lines represent the best fit to the data points, the red dash-dot lines represent the one-to-one relation 
and the contours show the regions of constant galaxy number. 
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Fig. 7.— The scatter plot of the 3D real-space overdensity, 1 + 6q D , versus 2D projected overdensity, 1 + S^ D , with various secondary 
magnitude limit in 2D measurements (series of row panels, from left to right: m| < 21, m| < 23 and m| < 25) and 3D measurements 
(series of column panels, from bottom to top: mf < 21, mf < 23 and raf < 25) over the redshift interval 0.3 < 2 : < 0.5. All cases are 
considered by using galaxy samples with photo-z error = 0.02(1 + z), V cu t = ±0.02(1 + z) and m? < 25. The numbers printed in the 
bottom-left of each panel indicate the r s coefficient. The black dash-dot lines represent the best fit to the data points, the red dash-dot 
lines represent the one-to-one relation and the contours show the regions of constant galaxy number. 
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Fig. 8. — The scatter plot of the 3D real-space overdensity, 1 + Sq D versus 2D projected overdensity, 1 + 8q D , with different photo-z 
errors over the redshift interval 0.3 < z < 0.5. All cases are considered by using galaxy samples with photo-z error = 0.00, 0.02(1 + 2 :), 
0.04(1 + 2 :) and 0.06(1 + 2 :) and Vc U t = ±0.001(1 + 2 :), ±0.02(1 + 2 :), ±0.04(1 + 2 :) and ±0.06(1 + 2 :) respectively. The primary and secondary 
magnitude limits are m? < 25 and m| < 25. The numbers printed in the bottom-left of each panel indicate the r s coefficient. The black 
dash-dot lines represent the best fit to the data points, the red dash-dot lines represent the one-to-one relation and the contours show the 
regions of constant galaxy number. 
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Fig. 10.— The largest r s (upper-panel) obtained by varying A^d and the corresponding choice of A^d (lower-panel) that yields the best 
correlation between the real-space and 2D projected environments for different choices of Af 2 D as a function of from mock galaxy 
catalogs. Different color-dots correspond to samples with different photo-z errors: 0.0, 0.02(1 + z), 0.04(1 + z) and 0.06(1 + z). 






20 


<n 

b~ 


0.8 


0.6 


0.4 


0.2 


2D: 

S 

< 

2$ 

& 

3D: 

S 

m i 

< 

25 

2D: 

m- s 

< 

23 

& 

3D: 

s 

m. 

< 

25 

2D: 

m : s 

< 

25 

& 

3D: 

s 

m- 

< 

25 


0.3 < z < 0.5 
mj P < 25 

zp_error = 0.02(1 +z) 


* ' ’ ’ : 5 • ::::rsss 


1 


10 


100 



3D 

Fig. 11.— The largest r s (upper-panel) obtained by varying A^d and the corresponding choice of AT 2 d (lower-panel) that yields the best 
correlation between the real-space and 2D projected environments for different choices of AT 2 d as a function of from mock galaxy 
catalogs. Here the secondary magnitude limit in 3D measurement is set to m| = 25. Different color-dots correspond to samples with 
different secondary magnitude limits in 2D measurements: ra| < 21, m| < 23 and m| < 25. 
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Fig. 12.— Similar to Figure 11 but the secondary magnitude limit in 3D measurement is set to m| = 23. 
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Fig. 13.— Similar to Figure 11 but the secondary magnitude limit in 3D measurement is set to m| =21. 
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Fig. 14. — Upper panels: color-magnitude diagrams for galaxies in the 20% most dense (red contour) and 20% least dense (blue contour) 
environments with different photo-z errors (from left to right: 0, 0.02(1 + z), 0.04(1 + z) and 0.06(1 + z)). Lower panels: the red fraction, 
/red? as a function of z-band apparent magnitude with different percentage of density levels: the 20% most dense (red), 60% - 80% densest 
(orange), 40% - 60% densest (yellow), 20% - 40% densest (green) and 20% least dense (blue). The error bars are given by Poisson statistics, 
and the contours show the regions of constant galaxy number. 
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Fig. 15. — Upper panels: color-magnitude diagrams for galaxies in the 20% most dense (red contour) and 20% least dense (blue contour) 
real-space environments with different secondary magnitude limits (from left to right: m| < 25, m| < 23 and m| < 21). Lower panels: the 
red fraction, f re d, as a function of z-band apparent magnitude with different percentage of density levels: the 20% most dense (red), 60% 
- 80% densest (orange), 40% — 60% densest (yellow), 20% - 40% densest (green) and 20% least dense (blue). The error bars are given by 
Poisson statistics, and the contours show the regions of constant galaxy number. 
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Fig. 17. — Relationship between host halo mass and galaxy overdensity, 1 + <5^° assuming no redshift error, with three different secondary 
magnitude limits (series of top panels, from left to right: m| < 25, m| < 23 and m| < 21) and choices of Nsd (series of bottom panels, 
from left to right: A^ 3 d = 6, N^d = 30 and IV 3D — 60). The contours show the regions of constant galaxy number. 
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Fig. 19.— The largest r s obtained by varying N 2 D, f° r large choice of JVjjD (top-left panel) and small choice of (top-right panel), 
and the corresponding choice of JV 2 D (series of bottom panels) that yields the best correlation between the real-space and 2D projected 
environments for different choices of A^d as a function of from mock galaxy catalogs. These cases in which the redshift uncertainty 
is zero, m? < 25.0 and m| < 25.0 with a wide range of sampling rates. Different colors are for samples with different sampling rate 
(namely, spectroscopic completeness): 10%, 20%, 40%, 60%, 80% and 100%, and different line-styles represent cases with different photo-z 
uncertainties as shown in Figure 10. 
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Fig. 20.— Similar to Figure 19 but with m| < 24.1 in the case of the redshift uncertainty equal to zero. 
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Fig. 21.— Similar to Figure 19 but with mf < 22.5 in the case of the redshift uncertainty equal to zero. 
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Fig. 22.— The largest r a (upper-panel) obtained by varying AOi , and the corresponding choice of A0n (lower-panel) that yields the best 
correlation between the real-space and 2D projected environments for different choices of A0o as a function of IV 3 D from mock galaxy 
catalogs. Different colors represent cases with various outlier rates in the case of photo-z uncertainty equal to 0.02(1 + z), and different 
line-styles represent cases with different photo-z uncertainties as shown in Figure 10. 
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Fig. 23.— Upper panels: color-magnitude diagrams for galaxies in the 20% most dense (red contour) and 20% least dense (blue contour) 
environments with different percentage of outlier rate (from left to right: 5%, 10%, 15% and 20%). Lower panels: the red fraction, / re d, as 
a function of z-band apparent magnitude with different percentage of density levels: the 20% most dense (red), 60% - 80% densest (orange), 
40% - 60% densest (yellow), 20% - 40% densest (green) and 20% least dense (blue). The error bars are given by Poisson statistics, and 
the contours show the regions of constant galaxy number. 
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Fig. 24.— Left-panel: The scatter plot of the 3D real-space overdensity, 1 + Sq D versus 2D projected overdensity, 1 + Sq D , with galaxy 
samples in the EGS field. The 2D and 3D environments are calculated by using redshifts from Pan-STARRSl and DEEP2 respectively. 
Right-panel: similar to the left panel but using mock galaxy catalogs for the case with real-space overdensity and case with photo-z error 
= 0.06(1 + z) and 6% outliers. The primary and secondary magnitude limits are considered by using m? < 25 and m| < 25. The numbers 
printed in the bottom-left of each panel indicate the r s coefficient. The black dash-dot lines represent the best fit to the data points, the 
red dash-dot lines represent the one-to-one relation and the contours show the regions of constant galaxy number. 
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Fig. 25.— Upper panels: the color-magnitude diagrams for galaxies in the 20% most dense (red contour) and 20% least dense (blue 
contour) galaxy environments with different datasets over the redshift interval 0.3 < z < 0.5 (Left: spectral-z sample from DEEP2 in the 
EGS field; Mid: PS1 photo-z sample in the overlapping region with the EGS field; Right: PS1 photo-z sample with ~ 5 deg 2 ). Lower 
panels: the red fraction, / re d, as a function of z-band apparent magnitude in the 20% most dense (red) and 20% least dense (blue) galaxy 
environments. The error bars are given by Poisson statistics, and the contours show the regions of constant galaxy number. 
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Fig. 26. — Similar to Figure 14, but for the redshift range of 0.6 < 2: < 0.8. 
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Fig. 28.— The largest r s (upper-panel) obtained by varying A^d and the corresponding choice of A^d (lower-panel) that yields the best 
correlation between the real-space and 2D projected environments for different choices of AT 2 d as a function of IV 3 D from mock galaxy 
catalogs. Different color-dots correspond to samples with different V cu t = ±0.005(1 ± z ), ±0.02(1 ± z), ±0.04(1 ± z) and ±0.06(1 ± z). 
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Fig. 29.~ The r s difference between the two cases, optimized AT 2 d and -/V 2 r) — IV 3 D, normalized by the former. Red, green, blue and 
cyan colors are for V cu t = ±0.005(1 ± z ), ±0.02(1 ± z), ±0.04(1 + z) and ±0.06(1 ± z) respectively. 




